CDS

Accession Number TCMCG075C21372
gbkey CDS
Protein Id XP_007020793.2
Location complement(join(1813495..1813643,1814276..1814382,1814476..1814601,1814688..1814775,1815290..1815410,1815629..1815701,1815980..1816043,1816159..1816370,1816450..1816514,1816597..1816699,1817699..1817811))
Gene LOC18593486
GeneID 18593486
Organism Theobroma cacao

Protein

Length 406aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007020731.2
Definition PREDICTED: probable beta-1,3-galactosyltransferase 4 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K20855        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005794        [VIEW IN EMBL-EBI]
GO:0012505        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAGTTGGAAGAGTAAAGCACTTGAACCAAATTCAAAGTCTGTGGTGACAAAAAAATGGACTCTGTTGCTTTGCATTGGCTGTTTTTGTGCTGGGATGCTCTTTTCTGATAGAATGTGGGCAGTGCCAGAGGCTGATGATAAAGGTGTATCACGAGAAACAGGAGCTAAAGAGGAAGGGCTAAAGTTAATTACAGAGGGTTGTGATCCAATGCGAAAGGATGTGAAGCGTGAACCAAAGGATATACTAGGGGAAGTTTCAAAGACTCATCATGCTATACAAACACTGGATAAAACAATATCAAATTTGGAGATGGAGTTAGCTGCTGCAAGGGCCGCACAGGAATCTATAATTAATGGTTCTCCCATTTCAGATGACCTGAAAATCCCTGAATCAACTGGGAAGCGGAAATATTTAATGGTTGTAGGCATCAATACTGCTTTTAGCAGCAGAAAACGAAGAGATTCAGTTCGTGCTACTTGGATGCCTCAAGGGAAAGAAAGGAAGAATCTTGAAGAAGAGAAGGGAATCATAATGCGATTTGTAATTGGTCACAGTGCTACCTCAGGGGGTATCCTCGATAGGGCTATTGAAGCAGAAGACAGAAAGCATGGTGACTTTCTGAGGCTGGAGCATGTTGAAGGTTACCTTGAATTATCAGCCAAGACAAAGGCATACTTTGCCACTGCTGCTGCCTTGTGGGATGCTGATTTCTATGTCAAAGTTGATGATGATGTGCATGTAAATATAGCAACACTTGGAGCAACTTTGGTTAGACATCGATCCAAGCCGAGGGTTTATATTGGTTGCATGAAATCTGGTCCAGTTCTTGCTCAAAAGGGAGTAAGATACCACGAACCTGAATATTGGAAATTTGGTGAGGAGGGAAACAAGTATTTCCGCCATGCCACAGGGCAGCTATATGCCATTTCGAAAGATTTGGCTTCCTATATATCCATTAACCAGCATGTGCTGCATAAGTATGCTAATGAAGATGTTTCATTGGGATCATGGTTTATCGGGTTGGACGTAGATCATATAGATGACCGCAGACTGTGCTGTGGTACCACTGATTGTGAGTGGAAGGCTCAAGCAGGCAACATCTGTGTTGCTTCATTCGACTGGACCTGCAGCGGGATTTGCAAGTCAGTTGAGAGGATGAAGGAGGTCCACCGGCGGTGTGGAGAAGACAAGAATGCTTTGTGGAGTGCAGCTTTCTAA
Protein:  
MSWKSKALEPNSKSVVTKKWTLLLCIGCFCAGMLFSDRMWAVPEADDKGVSRETGAKEEGLKLITEGCDPMRKDVKREPKDILGEVSKTHHAIQTLDKTISNLEMELAAARAAQESIINGSPISDDLKIPESTGKRKYLMVVGINTAFSSRKRRDSVRATWMPQGKERKNLEEEKGIIMRFVIGHSATSGGILDRAIEAEDRKHGDFLRLEHVEGYLELSAKTKAYFATAAALWDADFYVKVDDDVHVNIATLGATLVRHRSKPRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAISKDLASYISINQHVLHKYANEDVSLGSWFIGLDVDHIDDRRLCCGTTDCEWKAQAGNICVASFDWTCSGICKSVERMKEVHRRCGEDKNALWSAAF